Skip to main content

All Questions

5votes
1answer
49views

How to do Exploratory Data Analysis when my response variable is binary?

I am doing a multilevel regression, and my response variable is binary (presence of females on a tech board). all the EDA methods i know are about plotting correlation, but this as this is a binary i ...
Anya's user avatar
0votes
1answer
88views

Why would a model for a dataset produce different root mean square error results?

I'm trying to get a Data Scientist Associate certificate from DataCamp. I have a problem with the sample exam provided to practice for the Practical Exam. I have provided the details below. I'd really ...
astrobiologist's user avatar
0votes
1answer
40views

Partial Least Squares model returning negative values for curve or spline like output

I have a dataset which measures the different altitude of a basketball thrown at 4 intervals, 1 second, 2 seconds, 3 seconds, 4 seconds. The features are weight, height, years in sports participation. ...
ChairmanMeow's user avatar
0votes
0answers
38views

P value for partial least squares regression in python

is there a way to calculate Pvalues for Partial Least Squares (PLS)? Or any preferred method to evaluate coefficient significance for PLS method? Preferably done in python
Borla312's user avatar
1vote
1answer
88views

How to model a polynomial function with many features and multiple outputs

With linear regression (and multiple linear regression), one can have many features but only one output as y. I need to build a model with multiple x's that also outputs multiple y's in a polynomial ...
Borla312's user avatar
0votes
1answer
56views

How can I improve my predictive model?

Here is my interpretation of my model so far, I am investigating the relationship between ratings and followers on video games, but there is a problem. The more you get high ratings, the more you get ...
Hugo Guay's user avatar
0votes
1answer
116views

Build a Neural Network for Multi-output Regression

I have a network model that accepts about 25 inputs and outputs 3 actions. The outputs are: delta X and delta Y of the robot and the angle of the robot. After I enter the data into the model, I get ...
May's user avatar
3votes
1answer
66views

When do you know training a model is not feasible?

A couple weeks ago I volunteered to take on a project at work to try predicting the ideal price rental cars my employer should be charging based on our historical rental data. The variables available ...
enmasse's user avatar
0votes
1answer
86views

What's the best model choice for a non-linear regression task?

I have a dataset with the following format: Rows: 3700, float_columns: 17, int_columns: 2, categorical_columns: 12 Target Type: Continous, float My dataset is an insurance dataset that stores the ...
Connor's user avatar
2votes
2answers
68views

Handling Month-over-Month data in Regression Model

I have data similar to what you see in the picture. I want to use a RandomForest Regression model where I can use fields (excluding MONTH_END_DT and LOCATION_ID) to predict REVENUE_PER_UNIT. The idea/...
Larry Burholme's user avatar
0votes
1answer
336views

Effect on regression coefficients by multiplying a constant to a feature

I was solving one quiz question on Coursera and I found an interesting question. If you double the value of a given feature (i.e. a specific column of the feature matrix), what happens to the least-...
teddcp's user avatar
0votes
3answers
1kviews

Best model for regression in this case?

I am doing some modeling to predict a variable of interest given a big set of features (500) for which I expect a considerable amount of interactions happening at least among some of them. I first ...
Mirko's user avatar
1vote
1answer
137views

improving Neural network regression model

I have the following toy data (which closely mimicks my original larger data used for the project): ...
Ayan Mitra's user avatar
0votes
1answer
73views

How to treat missing values depending on what missing means

I have a dataset with quotes from an insurance company. I am trying to create a model to predict how much should the company charge the customer according to the different variables. Two of the ...
Javi's user avatar
  • 103
0votes
0answers
22views

Classification-Problem based on limited Dataset (Need keywords to search/ reading recommendations)

I have a dataset of about 200 test subjects, each with age, testscore and a one of two possible traits. I want to define some kind of function, where I input age and testscore, that predicts which of ...
inane's user avatar

153050per page
close